Add configurable default model support #161

Qard · 2026-01-12T23:54:31Z

This change allows users to configure which model to use as the default for all evaluations, replacing the hardcoded gpt-4o default.

Changes:

Add defaultModel parameter to init() in both JS and Python
Add getDefaultModel() function to retrieve configured default model
Update LLMClassifier and RAGAS scorers to use configurable default model
Update documentation with examples for different use cases

This enables:

Using different OpenAI models (gpt-4-turbo, o1, gpt-3.5-turbo, etc.)
Using non-OpenAI models via Braintrust proxy (Claude, Gemini, Llama, etc.)
Configuring once and having all evaluators use the preferred model

Example usage:

init({
  client: new OpenAI({
    apiKey: process.env.BRAINTRUST_API_KEY,
    baseURL: "https://api.braintrust.dev/v1/proxy",
  }),
  defaultModel: "claude-3-5-sonnet-20241022",
});

Fixes #136

github-actions · 2026-01-12T23:57:17Z

Braintrust eval report

Autoevals (model-flexibility-1768324125)

Score	Average	Improvements	Regressions
NumericDiff	73.4% (+1pp)	3 🟢	1 🔴
Time_to_first_token	1.33tok (-0.06tok)	85 🟢	33 🔴
Llm_calls	1.55 (+0)	-	-
Tool_calls	0 (+0)	-	-
Errors	0 (+0)	-	-
Llm_errors	0 (+0)	-	-
Tool_errors	0 (+0)	-	-
Prompt_tokens	279.25tok (+0tok)	-	-
Prompt_cached_tokens	0tok (+0tok)	-	-
Prompt_cache_creation_tokens	0tok (+0tok)	-	-
Completion_tokens	19.3tok (+0tok)	-	-
Completion_reasoning_tokens	0tok (+0tok)	-	-
Total_tokens	298.54tok (+0tok)	-	-
Estimated_cost	0$ (+0$)	-	-
Duration	3.14s (-0.42s)	140 🟢	79 🔴
Llm_duration	2.58s (-0.22s)	106 🟢	13 🔴

ibolmo

LGTM!

This change allows users to configure which model to use as the default for all evaluations, replacing the hardcoded gpt-4o default. Changes: - Add `defaultModel` parameter to `init()` in both JS and Python - Add `getDefaultModel()` function to retrieve configured default model - Update LLMClassifier and RAGAS scorers to use configurable default model - Update documentation with examples for different use cases This enables: - Using different OpenAI models (gpt-4-turbo, o1, gpt-3.5-turbo, etc.) - Using non-OpenAI models via Braintrust proxy (Claude, Gemini, Llama, etc.) - Configuring once and having all evaluators use the preferred model Example usage: ```javascript init({ client: new OpenAI({ apiKey: process.env.BRAINTRUST_API_KEY, baseURL: "https://api.braintrust.dev/v1/proxy", }), defaultModel: "claude-3-5-sonnet-20241022", }); ``` Fixes #136 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

github-actions · 2026-01-13T17:10:45Z

Braintrust eval report

Autoevals (main-1768324249)

Score	Average	Improvements	Regressions
NumericDiff	72.5% (-1pp)	1 🟢	3 🔴
Time_to_first_token	1.34tok (+0.01tok)	50 🟢	68 🔴
Llm_calls	1.55 (+0)	-	-
Tool_calls	0 (+0)	-	-
Errors	0 (+0)	-	-
Llm_errors	0 (+0)	-	-
Tool_errors	0 (+0)	-	-
Prompt_tokens	279.25tok (+0tok)	-	-
Prompt_cached_tokens	0tok (+0tok)	-	-
Prompt_cache_creation_tokens	0tok (+0tok)	-	-
Completion_tokens	19.3tok (+0tok)	-	-
Completion_reasoning_tokens	0tok (+0tok)	-	-
Total_tokens	298.54tok (+0tok)	-	-
Estimated_cost	0$ (+0$)	-	-
Duration	2.86s (-0.28s)	105 🟢	111 🔴
Llm_duration	2.72s (+0.14s)	30 🟢	89 🔴

Qard requested a review from ibolmo January 12, 2026 23:54

Qard self-assigned this Jan 12, 2026

Qard added enhancement New feature or request lang:python lang:typescript labels Jan 12, 2026

Qard requested a review from ankrgyl January 12, 2026 23:54

Qard force-pushed the model-flexibility branch 3 times, most recently from 48320a5 to 91f19f8 Compare January 13, 2026 00:00

ibolmo approved these changes Jan 13, 2026

View reviewed changes

Qard force-pushed the model-flexibility branch 3 times, most recently from 7d7b9da to 7f8c1fd Compare January 13, 2026 00:33

Qard requested a review from ibolmo January 13, 2026 00:35

ibolmo approved these changes Jan 13, 2026

View reviewed changes

Qard force-pushed the model-flexibility branch from 7f8c1fd to d616f67 Compare January 13, 2026 17:08

Qard merged commit 1ff945d into main Jan 13, 2026
7 checks passed

Qard deleted the model-flexibility branch January 13, 2026 17:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add configurable default model support #161

Add configurable default model support #161

Uh oh!

Qard commented Jan 12, 2026

Uh oh!

github-actions bot commented Jan 12, 2026 •

edited

Loading

Uh oh!

ibolmo left a comment

Uh oh!

Uh oh!

github-actions bot commented Jan 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add configurable default model support #161

Add configurable default model support #161

Uh oh!

Conversation

Qard commented Jan 12, 2026

Uh oh!

github-actions bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Braintrust eval report

Uh oh!

ibolmo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Braintrust eval report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Jan 12, 2026 •

edited

Loading

github-actions bot commented Jan 13, 2026 •

edited

Loading